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DNA POLYMERASES HAVING IMPROVED LABELED 
NUCLEOTIDE INCORPORATION PROPERTIES 

5 Inventors: John Brandis, Curtis Bloom, and Jack Richards 

Field of the Invention 

The invention is related to DNA polymerases having mutations that alter the ability 
10 of the enzyme to incorporate labeled nucleotides into a polynucleotide molecule. 

Background 

DNA polymerases are enzymes that synthesize the formation of DNA molecules from 
deoxynucleotide triphosphates using a template DNA strand and a complementary synthesis 
1 5 primer annealed to a portion of the template. A detailed description of DNA polymerases and 

their enzymological characterization can be found in Kornberg, DNA Replication Second 
Edition . W. H. Freeman (1989). 

DNA polymerases have a variety of uses in molecular biology techniques suitable for 
both research and clinical applications. Foremost among these techniques are DNA 
20 sequencing and nucleic acid amplification techniques such as PCR (polymerase chain 

reaction). 

The amino acid sequence of many DNA polymerases have been determined. 
Sequence comparisons between different DNA polymerase have identified many regions of 
homology between the different enzymes. X-ray diffraction studies have determined the 

25 tertiary structures of Klenow fragment, T7 DNA polymerase, and Taq DNA polymerase. 

Studies of the tertiary structures of DNA polymerases and amino acid sequence comparisons 
have revealed numerous structural similarities between diverse DNA polymerases. In general, 
DNA polymerases have a large cleft that is thought to accommodate the binding of duplex 
DNA. This cleft is formed by two sets of helices, the first set is referred to as the "fingers" 

30 region and the second set of helices is referred to as the "thumb" region. The bottom of the 

cleft is formed by anti-parallel p sheets and is referred to as the "palm" region. Reviews of 
DNA polymerase structure can be found in Joyce and Steitz , Ann. Rev. Biochem 63:777- 
822 (1994). Computer readable data files describing the three-dimensional structure of some 
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DNA polymerases have been publicly disseminated. 

Fluorescently labeled nucleotides have greatly simplified and improved the utility of 
many procedures in molecular biology. The use of fluorescently labeled nucleotides for 
labeling polynucleotides in synthesis procedures, has to a large extent replaced the use of 
5 radioactive labeling. Fluorescently labeled nucleotides have been widely used in DNA 

sequencing, see Smith et al Nature 321:674-679 (1986), in PCR, and other forms of 
polynucleotide fragment analysis. 

A major problem with using fluorescently labeled nucleotides is the ability of DNA 
polymerases to discriminate against the incorporation of fluorescently labeled nucleotides. 

10 For example, the inventors have discovered that in competition assays between a TET (6- 

carboxy-4,7,2\7'-tetrachlorofluorescein) labeled T 3* dideoxynucleotide and the 
corresponding unlabeled dideoxynucleotide, Taq DNA polymerase incorporates the unlabeled 
dideoxynucleotide into DNA at least 85 times more frequently than the corresponding 
unlabeled nucleotide. This discrimination between labeled and unlabeled nucleotides has 

1 5 profound effects on procedures using DNA polymerases to label DNA. For example, much 

larger amounts of fluorescently labeled nucleotide must be used in sequencing reactions. This 
large amount of fluorescently labeled nucleotide is expensive and can generate excessive 
background fluorescence, thereby reducing the yield of sequence information. 

In view of the problems arising from the ability of DNA polymerases to discriminate 

20 against the incorporation of fluorescently labeled nucleotides, the inventors have developed 

several novel DNA polymerases that have reduced discrimination against the incorporation 
of one or more fluorescently labeled nucleotides into DNA. 



Summary 

Naturally occurring DNA polymerases preferentially incorporate unlabeled nucleotides 
over corresponding fluorescently labeled nucleotides into polynucleotides. This ability of 
DNA polymerases to discriminate against fluorescently labeled nucleotide has undesirable 
effects on many molecular biology procedures that require the enzymatic addition of 
fluorescently labeled nucleotides, e.g., labeled dideoxy terminator sequencing. The present 
invention relates to mutant DNA polymerases that exhibit reduced discrimination against 
fluorescently labeled nucleotides into polynucleotides. 

The DNA polymerases of the invention have at least one mutation in the nucleotide 
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label interaction region of the enzyme such that the mutation results in reduced discrimination 
against fluorescently labeled nucleotides. The nucleotide label interaction region of a DNA 
polymerase is formed by portions of the O-helix, (ii) the K helix, and (iii) the inter O-P helical 
loop of Taq DNA polymerase or analogous positions in other DNA polymerases. Amino acid 
residues within the nucleotide label interaction region as defined by TET (II) -ddC are E520. 
A531, L522, R523, E524, A525, H526, P527, I 528, V529, E530, K531, 1532, R536, E537, 
R573, Q582, N583, V586, R587, P589, Q592, R593, R595, D610, T612, Q613, E615, 
R636, D637, T640, F647, V654, D655, P656, L657, R659, R660, T664, E681, L682, A683, 
1684, P685, E688, F692, Q754, H784, L81 7, E820, L828, K831,and E832. The sites at 
R660, T664, and E681 are of prefered sites for introducing mutations. In a preferred 
embodiment of the invention for use with fluorescein-type dyes, a mutation is present at 
position 681 converting an E (glutamic acid) to M (methionine), i.e., E681M In a preferred 
embodiment of the invention for use with fluorescein- fluorescein energy transfer dyes a 
mutation is present at position 657 converting an L (leucine) to a G (glycine). In addition to 
15 providing mutant Taq DNA polymerases having reduced discrimination against labeled 

nucleotides, the invention includes mutants derived from a wide variety of DNA polymerases, 
both thermostable and otherwise. 

In addition to providing novel mutant DNA polymerases, the invention also provides 
polynucleotides encoding the subject mutant DNA polymerases. The polynucleotides 
20 provided may comprise expression vectors for the recombinant production of the mutant 

polymerases. The invention also includes host cells containing the subject polymerase 
polynucleotides. 

The invention also includes numerous methods of using the subject DNA polymerases. 
The subject methods involve synthesizing a fluorescently labeled polynucleotide by means of 

25 a polynucleotide synthesis reaction catalyzed by a mutant DNA polymerase that has reduced 

discrimination against incorporating labeled nucleotides into polynucleotides. The subject 
methods of polynucleotide synthesis include the step of extending a primed polynucleotide 
template with at least one fluorescent labeled nucleotide, wherein the extension is catalyzed 
by a DNA polymerase that has ^educed discrimination against labeled nucleotides into 

30 polynucleotides. The subject methods of synthesizing a fluorescently labeled polynucleotide 

may be used in a variety of methods such as Sanger sequencing and the polymerase chain 
reaction (PCR). 



-3- 

SUBSTITUTE SHEET ( rule 26 ) 



WO 98/40496 



PCT/US98/05095 



Another aspect of the invention is to provide kits for synthesizing fluorescently labeled 
polynucleotides in accordance with the methods of the invention. Kits of the invention 
comprise a mutant DNA polymerase of the invention and a fluorescently labeled nucleotide 
that exhibits reduced discrimination with respect to the mutant DNA polymerase in the kit. 

Brief Description of the Drawings 

Figure 1 is a computer model of DNA bound to Taq DNA polymerase. Amino acid 
residues that form the nucleotide label interaction site are highlighted in orange. The rest of 
the polymerase is indicated in green. The template is indicated in blue. The dye moiety of the 
labeled nucleotide is red. The remaider of the labled nucleotide is white. 

Figure 2 is plot of a next nucleotide effect assay 

Figure 3 is plot of a next nucleotide effect assay. 

Figure 4 is a representation of the structure of the fluorescently labeled nucleotide 
"TET(II>ddCTP " 

Detailed Description of Specific Embodiments of the Invention. 
Terminology 

Positions of amino acid residues within a DNA polymerase are indicated by either 
numbers or number/letter combinations. The numbering starts at the amino terminus residue. 
The letter is the single letter amino acid code for the amino acid residue at the indicated 
position in the naturally occurring enzyme from which the mutant is derived. Unless 
specifically indicated otherwise, an amino acid residue position designation should be 
construed as referring to the analogous position in all DNA polymerases, even though the 
single letter amino acid code specifically relates to the amino acid residue at the indicated 
position in Taq DNA polymerase. 

Individual substitution mutations are indicated by the form of a letter/number/letter 
combination. The letters are the single letter code for amino acid residues. The numbers 
indicate the amino acid residue position of the mutation site. The numbering system starts at 
the amino terminus residue. The numbering of the residues in Taq DNA polymerase is as 
described in U.S. Patent No. 5,079,352 (Gelfand). Amino acid sequence homology between 
different DNA polymerases permits corresponding positions to be assigned to amino acid 
residues for DNA polymerases other than Taq. Unless indicated otherwise, a given number 
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refers to position in Taq DNA polymerase. The first letter, i.e., the letter to the left of the 
number, represents the amino acid residue at the indicated position in the non-mutant enzyme. 
The second letter represents the amino acid residue at the same position in the mutant 
enzyme. For example, the term "R660D" indicates that the arginine at position 660 has been 
5 replaced by an aspartic acid residue. 

The term "discrimination" as used herein refers to the property of a DNA polymerase 
to preferentially incorporate unlabeled nucleotides over corresponding fluorescently labeled 
nucleotides into DNA, i.e., the DNA polymerase discriminates against the fluorescently 
labeled nucleotide. Preferential incorporation may be measured in an assay in which a 

10 fluorescently labeled 2'3' dideoxynucleotide and a corresponding unlabeled 2*3' 

dideoxynucleotide compete for incorporation into the same site of a polynucleotide. An 
example of such an assay can be found below in example 2. 

The term "reduced discrimination" as used herein refers to reduction in discrimination 
against incorporation of a fluorescently labeled nucleotides in a mutant DNA polymerase as 

15 compared to the parent enzyme. A reduction in discrimination may be described 

quantitatively by reference to the selectivity assays in Example 2 or reference to other assays 
providing for measurement of the same properties of the polymerase. A reduction in 
selectivity number as measured by the selectivity assays is a reduction in discrimination and 
may be expressed by a ratio of selectivity numbers. For example, a mutant DNA polymerase 

20 with a selectivity number of 8 would have a 10-fold reduction in discrimination when 

compared with a parent DNA polymerase having a selectivity number of 80. 

The term "parent" or "parent enzyme" is used to distinguish a mutant DNA 
polymerase from the DNA polymerase that the mutant enzyme was derived from. Thus any 
naturally occurring DNA polymerase may be referred to as parent enzyme. A first DNA 

25 polymerase having mutations with respect to a naturally occurring enzyme is also be referred 

to as a parent enzyme with respect to a second DNA polymerase having additional mutations. 

The term "discrimination reducing mutations 1 ' refers to mutations in the nucleotide 
label interaction region of a DNA polymerase that result in reduced discrimination against the 
incorporation of fluorescently labeled nucleotides. The term is used to distinguish mutations 

30 in a DNA polymerase, including mutations in the nucleotide label interaction region, that do 

not reduce discrimination against fluorescently labeled nucleotides from mutations that do 
reduce discrimination. 
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The term "nucleotide" as used herein, unless specifically noted otherwise, is used 
broadly to refer to both naturally occurring nucleotide and a variety of analogs including 2\3' 
dideoxynucleotides. 

The term "fluorescein-type dyes" refers to a class of xanthene dye molecules which 
5 include the following fused three-ring system: 




where a wide variety of substitutions are possible at each deoxy ring position. A particularly 
preferred subset of fluorescein-type dyes include the 4,7,-dichorofluoresceins (Menchen). 

10 Examples of fluorescein-type dyes used as fluorescent labels in DNA sequencing methods 

include 6-carboxyfluorescein (6-FAM), 5-carboxyfluorescein (5-FAM), 6 -carboxy-4,7,2',7'- 
tetrachlorofluorescein (TET), 6-carboxy-4,7,2',4',5 , ,7 , -hexachlorofluorescein (HEX), 5-(and 
6)carboxy-4\5'-dichloro-2'7 , -dimethoxyfluorescein (JOE), and 5-carboxy-2\4\5\7'- 
tetrachlorofluorescein (ZOE). Many times the designation -1 or -2 is placed after an 

15 abbreviation of a particular dye, e.g., HEX-1. The and "-2" (or "F and "II") 

designations indicate the particular dye isomer being used. The 1 and 2 isomers are defined 
by the elution order (the 1 isomer being the first to elute) of free dye in a reverse-phase 
chromatographic separation system utilizing a C-8 column and an elution gradient of 1 5% 
acetonitrile/85% 0.1 M triethylammonium acetate to 35% acetonitrile / 65% 0.1 M 

20 triethylammonium acetate. 

The term "alkynylamino type linker" refers to an alkynylamino linker of the type as 
described in U.S. Patent No. 5,047,519 (Hobbs), U.S. Patent No. 5,151,507 (Hobbs), and 
U.S. Patent Application No. 08/696,808, filed August 13, 1996. Additional alkynylamino 
type liners are described in U. S. Patent Application No. 08/833,855, filed April 10, 1997. 

25 The term "TET(II>ddCTP" refers to the fluorescently labeled nucleotide of the 

structure indicated in figure 4. 

The term "fluorescence energy transfer dye" refers to dye moieties joined by a linker 
that permits fluorescence energy transfer between the two dye moieties. For use in chain 
termination sequencing, the linker is sufficiently small and of the proper shape and orientation 
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to permit a DNA polymerase to incorporate a nucleotide triphosphate labeled with the dye 
interest. Examples of energy transfer dyes can be found in European Patent Application No. 
EP 0 805 140, U.S. Patent Application No. 08/642,330 (filed May 3, 1996), and U.S. Patent 
Application No. 08/726,462 (filed October 4, 1996). 

The term "mutation" as used herein refers to a change in amino acid residue at a 
specific location of a protein. The change in amino acid residue is a change defined with 
respect to a naturally occurring protein. A protein having a mutation may be referred to as 
a "mutant" protein. 

Embodiments of the Invention 

The present invention relates to DNA polymerases containing mutations that reduce 
the ability of the polymerase to discriminate against the incorporation of fluorescently labeled 
nucleotides into polynucleotides. These mutations are in a region of the DNA polymerase 
molecule referred to herein as "the nucleotide label interaction region." The nucleotide label 
interaction region is formed by portions of three regions of the DNA polymerase. These three 
regions are located in (i) the O-helix, (ii) the K helix, and (iii) the inter O-P helical loop of Taq 
DNA polymerase or analogous positions in other DNA polymerases. DNA polymerases 
having reduced discrimination against fluorescently labeled nucleotides are particularly useful 
for chain termination DNA sequencing using 2*3' dideoxynucleotides, i.e., Sanger type 
sequencing. 

Enzyme kinetic experiments (described in examples 2 and 3) performed with Taq 
DNA polymerase and fluorescently labeled dideoxynucleotides support a theory that Taq 
DNA polymerase and other DNA polymerases, undergo a conformational shift upon the 
binding of nucleotides during DNA synthesis. This predicted conformational shift suggests 
a set of amino acid residues that interact with fluorescent labels joined by a linker to the 
nucleic acid base of a nucleotide, thereby resulting in discrimination against nucleotides that 
are fluorescently labeled. This set of amino acid residues forms the nucleotide label 
interaction region. The specific molecular model for the binding of fluorescently labeled 
nucleotide to a DNA polymerase proposed by the applicants is used to predict the amino acid 
residues that form the nucleotide label interaction region of a given DNA polymerase 
Applicants model for a conformational shift in DNA polymerase during DNA synthesis is 
offered as a explanation of how the nucleotide label interaction region was determined. The 
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model provides guidance in making mutations in DNA polymerase that reduce the ability of 
a DNA polymerase to discriminate against the incorporation of fluorescently labeled 
nucleotides into polynucleotides. Figure 1 is a computer model showing how DNA and Taq 
DNA polymerase interact in the model. Whether or not the true mechanism of DNA 
5 polymerase-nucleotide interaction is the same or different as the model used to determine the 

parameters of the nucleotide label interaction region is not determinative to the operability of 
the invention described herein. 

The mutant DNA polymerases of the invention exhibit reduced discrimination against 
nucleotides labeled with a fluorescein-type dye. In other words, the mutant DNA 

10 polymerases of the invention contain at least one mutation that increases the ability of the 

polymerase to incorporate a fluorescein-type dye labeled nucleotide relative to the 
corresponding unlabeled nucleotide. In addition to reduced discrimination against nucleotides 
labeled with fluorescein-type dyes, the mutant DNA polymerases of the invention may also 
exhibit reduced discrimination against nucleotides labeled with other fluorescent dyes that are 

15 not fluorescein-type dyes, as well as reduced discrimination against other detectable moieties. 

The fluorescently labeled nucleotides for which a given embodiment of the mutant DNA 
polymerases of the invention exhibit reduced discrimination may vary with respect to the 
particular fluorescent label, the linker used to attach the fluorescent label to the nucleotide, 
the site of attachment for the linker on the fluorescent label, the specific nucleotide base that 

20 is selected, and the site of attachment for the linker on the nucleotide. The precise degree of 

reduction in discrimination against a fluorescently labeled nucleotide will vary in accordance 
with the specific mutation or mutations introduced into the DNA polymerase. The precise 
degree of reduction in discrimination will also vary in accordance with the specific 
fluorescently labeled nucleotide assayed, e.g., variations in base, dye, or linker. Mutant DNA 

25 polymerase of the invention may exhibit anywhere from a slight reduction in discrimination 

against fluorescently labeled nucleotides to a complete elimination in discrimination, i.e., the 
mutant enzyme does not significantly differ with respect of rate of incorporation of labeled 
or unlabeled nucleotides. It is preferable to use embodiments of the subject mutant DNA 
polymerases that have at least a two-fold reduction in discrimination against one or more 

30 fluorescein type dye labeled nucleotides. 

It will be appreciated by persons skilled in the art of molecular biology that the 
nucleotide label interaction region of a given DNA polymerase is defined with respect to a 
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specific fluorescently labeled nucleotide. Changes in one or more of the following parameters 
of the structure of a fluorescently labeled nucleotide may alter the identity of the amino acid 
residues that form the nucleotide label interaction site of a given DNA polymerase: (1) 
identity of the base, (2) the site of attachment on the nucleotide base, (3) the identity of the 
5 linker joining the base to the florescent dye, and (4) the identity of the fluorescent dye. The 

nucleotide labeled interaction region of Taq defined with respect to TET(II)«ddCTP 
comprises the amino acid residues E520. A531, L522, R523, E524, A525, H526, P527, I 
528, V529, E530, K531, 1532, R536, E537, R573, Q582, N583, V586, R587, P589, Q592, 
R593, R595, D610, T612, Q613, E615, R636, D637, T640, F647, V654, D655, P656, L657, 
10 R659, R660, T664, E681, L682, A683, 1684, P685, E688, F692, Q754, H784, L817, E820, 

L828, K831, and E832. The sites at R660, T664, and E681 are of prefered sites for 
introducing mutations. Given that the 3-dimensional structure of Taq DNA polymerase (and 
other DNA polymerases) is well known and the three dimensional structure of 
TET(II)«ddCTP is understood with a high degree of certainty, the location of the amino acid 
15 residues that constitute the labeled nucleotide interaction region with respect to 

TET(II)»ddCTP may be translated to a different set of amino acid residues to accommodate 
structural differences between TET(II)»ddCTP and other fluorescently labeled nucleotides so 
as to define the labeled nucleotide interaction site with respect to those other nucleotides. For 
example, increasing the length of the linker between the base and the fluorescent label and the 
20 base may predictably alter the identity of amino acid residues that form the labeled nucleotide 

interaction site, even though the base, base attachment site, and fluorescent dye are the same. 
In many embodiments of the subject polymerases, the set of amino acid residues that form 
the labeled nucleotide interaction site with respect to a given fluorescently labeled nucleotide 
will overlap with the set of amino acid residues that form the labeled nucleotide interaction 
25 site as defined with respect to a second fluorescently labeled nucleotide. 

Embodiments of the invention include mutant DNA polymerases that exhibit reduced 
discrimination against nucleotides labeled with fluorescein-type dyes, wherein the fluorescein 
type dye is joined to the nucleotide base by an alkynylamino-type linker. The fluorescein-type 
dye may be a fluorescent energy transfer dye, comprising a fluorescein-type dye moiety as a 
30 component of the energy transfer dye. In addition to reduced discrimination against 

fluorescently labeled nucleotides comprising an alkynylamino-type linker, the mutant DNA 
polymerases of the invention may also exhibit reduced discrimination against nucleotides 
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comprising other types of linker. In order to minimize stearic interference between the 
polynucleotide and the fluorescent label, purines are usually labeled at position 7 and 
pyrimidines are usually labeled at position 5. 

Mutant DNA polymerases of the invention have one or more discrimination reducing 
mutations at amino acid residue positions within the nucleotide label interaction region of a 
given DNA polymerase. Discrimination reducing mutations are usually, although not 
necessarily, substitution mutations. Several different amino residues may be substituted at 
a given position of a parent enzymes so as to give rise to a discrimination reducing mutations. 
The amino acid residues at a given residue position within the nucleotide label interaction 
region may be systematically varied so as to determine which amino acid substitutions result 
in the reduction of discrimination against the fluorescein-type dye labeled nucleotide dye of 
interest and the degree of such a reduction in discrimination. The extent to which a particular 
mutation (or set of mutations) reduces discrimination may be measured by a selectivity assay 
as described in example 2, The substitution mutation is preferably, although not necessarily, 
a mutation that reduces the size of the amino acid residue side chain of the amino acid residue 
present in the parent DNA polymerase. Mutations are preferably, although not necessarily, 
conservative so as to maintain the specific polar or non-polar character of the amino acid 
residue at the analogous position parent molecule. The mutations in the nucleotide label 
interaction region of a DNA polymerase preferably result in the substitution of the amino acid 
residue of the parent enzyme with the amino acid residue at the corresponding position of 
phage T7 DNA polymerase (provided that a difference exists between the amino acid residues 
at that position in T7 polymerase and the parent enzyme). 

Discrimination reducing mutations are in the nucleotide label interaction region of 
DNA polymerases. The nucleotide label interaction region is formed by portions of three 
regions of the DNA polymerase. These three regions are located in (i)the O-helix, (ii) the K 
helix, and (Hi) the inter O-P helical loop of Taq DNA polymerase or analogous positions in 
other DNA polymerases. Positions in Taq DNA polymerase that form the nucleotide label 
interaction region are positions E520. A531, L522, R523, E524, A525, H526, P527, I 528, 
V529, E530, K531, 1532, R536, E537, R573, Q582, N583, V586, R587, P589, Q592, R593, 
R595, D610, T612, Q613, E615, R636, D637, T640, F647, V654, D655, P656, L657, R659, 
R660, T664, E681, L682, A683, 1684, P685, E688, F692, Q754, H784, L817, E820, L828, 
K83 1, and E832. Analogous positions in DNA polymerases other than Taq are also form 
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a nucleotide label interaction region. Preferred positions for substitution mutations are R595, 
D655, R660, and E681. A particularly preferred position for mutations is E681, with the 
preferred substitution at position 681 being M. Other suitable substitution mutations at E681 
are as follows (listed in order of decreasing preference, excpt where note by a equal sign to 
5 denote approximate equivalence"): M>I>W>L>V>P>H=K=G=T=S>D=A=N>Y-C. A 

preferred substitution mutation at position R660 is R660D. 

The specific amino acid residues that form the nucleotide interaction region will vary 
in accordance with the particular DNA polymerase selected as a parent enzyme for the 
introduction of discrimination reducing mutations. The determination of analogous amino 

10 acid residues positions between different DNA polymerases may easily be achieved by the 

person skilled in the art because of the large number of DNA polymerase amino acid 
sequences that have been determined and the many regions of homology have been found 
between these different DNA polymerases. For example, a large compilation of the amino 
acid sequences of DNA polymerases from a wide range of organism and homology alignments 

1 5 between the sequences can be found in Braithwaite and Ito, Nucl. Acids Res. 2 1 (4): 787-802 

(1993). Examples of amino acid residues within the nucleotide label interaction regions of 
phage T7 polymerase and E. coli DNA polymerase are provided in Table 1 . In addition to 
providing mutant DNA polymerases having reduced discrimination for fluorescein type dyes 
in Taq, T7 and E, coli DNA polymerase I, the invention provides mutant DNA polymerases 

20 from many other organisms. In general, the teachings of the invention may used to produce 

mutant DNA polymerases having reduced discrimination for fluorescein type dyes from any 
DNA polymerase that shares sufficient amino acid sequence homology to Taq DNA 
polymerase to permit a person of ordinary skill in the art to identify one or more amino acid 
residue positions in the DNA polymerase that are analogous to positions E520. A53 1, L522, 

25 R523, E524, A525, H526, P527, 1 528, V529, E530, K531, 1532, E537, R573, V586, R587, 

P589, Q592, R593, R595, D610, T612, Q613, E615, R636, T640, F647, V654, D655, P656, 
L657, R659, R660, T664, E681, L682, A683, 1684, P685, E688, F692, Q754, L817, E820, 
L828, K831, and E832 in Taq DNA polymerase. Parent DNA polymerases that may be 
modified to contain discrimination reducing mutations in the nucleotide label interaction 

30 region include, but are not limited to, DNA polymerases from organisms such as Thermus 

flavus, Pyrococcus furiosus, Thermotoga neapolitana, Thermococcus litoralis, Sulfolobus 
solfataricus, Thermatoga maritima, E. coli phage T5, and E. coli phage T. The DNA 
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polymerases may be thermostable or not thermostable. It will be appreciated that the present 
invention enables persons skilled in the art to introduce fluorescein-type dye discrimination 
reducing mutations in to DNA polymerases from a wide variety of organisms, including DNA 
polymerases that have not been isolated at the time of the filing of this application provided. 
5 Additionally, embodiments of the invention includes some purified naturally-occurring DNA 

polymerases that have the desired low degree of discrimination against fluorescently labeled 
nucleotides. Such naturally-occurring DNA polymerases are structurally and functionally 
analogous to the mutant DNA polymerases explicitly described herein. 

The amino acid residues that constitute the nucleotide label interaction region of a 

10 given DNA polymerase vary in accordance with the specific fluorescently labeled nucleotide 

that is used to define the nucleotide label interaction region. Similarly, the mutations that are 
discrimination reducing mutations may vary in accordance with the specific fluorescently 
labeled nucleotide that is used to define the labeled nucleotide interaction region. 
Additionally, the degree of discrimination reduction achieved by the mutation (or mutations) 

15 in the labeled nucleotide interaction site may vary with the specific labeled nucleotide of 

interest. For example, E681M is the preferred discrimination reducing mutation in Taq with 
respect to TET(II)»ddCTP resulting in a 47x reduction in discrimination and a significantly 
lower reduction in discrimination against a second fluorescently labeled nucleotide. 
Conversely, an E681T mutation may result in a high level reduction in discrimination against 

20 the second fluorescently labeled nucleotide and only a low level of reduction in discrimination 

against TET(II)-ddCTP. 

Given that a mutant DNA polymerase of the invention may have discrimination 
reducing mutation in the nucleotide label interaction region resulting in a significant degree 
of reduction in discrimination for a specific fluorescently labeled nucleotide and little or no 

25 reduction in the degree of reduction of discrimination against another fluorescently labeled 

nucleotide (assuming there is significant discrimination against that fluorescently labeled 
nucleotide by the parent DNA polymerase), a given mutant DNA polymerase may be said to 
be "receptive" with respect to one or more given fluorescently labeled nucleotide. A specific 
mutant DNA polymerase is referred to as "receptive" with respect to a specific fluorescently 

30 labeled nucleotide if a discrimination reducing mutation in the nucleotide label interaction site 

in the specific enzyme of interest results in at least a five fold reduction in discrimination 
against that given fluorescently labeled nucleotide. A mutant DNA polymerase of the 
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invention may be receptive with respect to more than one fluorescently labeled nucleotide. 
Conversely, a specific fluorescently labeled nucleotide may be "receptive" with respect to a 
given mutant DNA polymerase of the invention. 

In embodiments of the subject mutant DNA polymerases comprising more than one 
discrimination reducing mutation in the nucleotide label interaction region, the mutation site 
may be in the same or different region of the three regions of a polymerase that form the 
nucleotide label interaction region. In general, mutant DNA polymerases of the invention will 
have 1 , 2, or 3 discrimination reducing mutations. However, the invention also provides 
mutant DNA polymerases having more than 3 discrimination reducing mutations. By 
combining multiple discrimination reducing mutations, greater levels of reduction in labeled 
nucleotide discrimination may be achieved. However, in many embodiments of the invention, 
mutant DNA polymerases have levels of reduced labeled nucleotide discrimination that are 
the same or less than the levels of DNA polymerase with single discrimination reduction 
mutations in the nucleotide label interaction region. Preferred combinations of mutations in 
a Taq DNA polymerase background are R660D, E681G, and F667Y, i.e., Taq DNA 
polymerase mutant (R660D, E681G, and F667Y). 

Different embodiments of DNA polymerase having mutations in the nucleotide label 
interaction region differ with respect to the degree of reduction in discrimination against 
specific fluorescently labeled nucleotides. These differences may be measured by an assay 
in order to determine which specific embodiments have the greatest degree of reduction in 
discrimination against the particular fluorescently labeled nucleotides of interest. Generally, 
such assays measure competition between a fluorescently labeled nucleotide and an unlabeled 
nucleotide for incorporation into the same site on a primed template. One example of such 
an assay (referred to herein as a "selectivity assay") is described in detail below in Example 
2. 

The mutant DNA polymerases of the invention may comprise numerous mutations in 
addition to discrimination reduction mutations in the nucleotide label interaction region. 
These secondary mutations may be either inside or outside the nucleotide label interaction 
region. Secondary mutations may be selected so as to have as to confer some useful property 
on the mutant DNA polymerase. For example, additional mutations may be introduced to 
increase thermostability; decrease thermostability, increase processivity, decrease processivity, 
decrease 3*-5' exonuclease activity, increase 3-5' exonuclease activity, decrease 5-3' 
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exonuclease activity, increase 5-3' exonuclease activity, and increase incorporation of 
dideoxynucleotides. Alternatively, the secondary mutations may be essentially neutral in 
known effect. 

Of particular interest are embodiments of the subject mutant DNA polymerase that 
5 comprise one or more secondary mutation that reduce 3-5' exonuclease activity. DNA 

polymerases that are deficient in 3-5' exonuclease activity have superior properties for PCR 
and for chain termination polynucleotide sequencing. Mutations that reduce 3-5' 
exonuclease activity in DNA polymerase are well known to person of ordinary skill in the art. 
Detailed guidance on how to introduce mutations that reduce 3*-5' exonuclease activity can 

10 be found, among other places in U.S. Patent No. 4,795,699 (Tabor); U.S. Patent No. 

5,541,099; US. Patent No. 5,489,523; and Bemad et al., CeU 59:219-288 (1989). Examples 
of such mutations in Taq DNA polymerase include G46D. For embodiments of the mutant 
DNA polymerases that are used for sequencing, it is preferable to include a G46D (or 
analogous mutations in DNA polymerases other than Taq) in addition to mutations in the 

15 nucleotide label interaction region. 

Also of interest among secondary mutations in the subject DNA polymerase mutants 
are mutations that increase incorporation of dideoxynucleotides, i.e., reduce the ability of 
a DNA polymerase to discriminate against dideoxynucleotide as opposed to 
deoxynucleotides. Guidance on making such mutations can be found, among other places in 

20 published PCT application WO96/12042 (application number PCT/US95/12928). Of 

particular interest is the mutation F667Y in Taq and analogous mutations in other DNA 
polymerase. While F667Y is not part of the nucleotide label interaction region in Taq DNA 
polymerase with respect to Tet(II>ddLTP, F667Y mutations may reduce discrimination 
against fluorescein-type dye labeled nucleotides (see Table 1 ). Accordingly, for use in certain 

25 procedures, e.g., DNA sequencing, be desirable to combine an F667Y mutations with one or 

more discrimination reducing mutations in the nucleotide label interaction region so as to 
reduce discrimination of the polymerase between deoxynucleotides and 2'3* 
dideoxynucleotides. Mutant DNA polymerase of the invention having the F667Y mutation 
(or equivalent thereof) are particularly useful in Sanger type DNA sequencing with 

30 fluorescently labeled 2*3' dideoxynucleotide chain terminators. 

Numerous genes encoding DNA polymerases have been isolated and sequenced. This 
sequence information is available on publicly accessible DNA sequence databases such as 
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GENBANK. A large compilation of the amino acid sequences of DNA polymerases from a 
wide range of organism can be found in Braithwaite and Ito, Nucl. Acids Res. 21(4):787-802 
(1993). This information may be used in designing various embodiments of DNA 
polymerases of the invention and polynucleotide encoding these enzymes. The publicly 
5 available sequence information may also be used to clone genes encoding DNA polymerases 

through techniques such as genetic library screening with hybridization probes. 

Other embodiments of the invention are polynucleotide sequences encoding the 
mutant DNA polymerases provided herein. Polynucleotide sequences encoding the mutant 
DNA polymerase of the invention may be used for the recombinant production of the mutant 

10 DNA polymerases. Polynucleotide sequences encoding mutant DNA polymerases having 

reduced discrimination against fluorescently labeled nucleotide may be produced by a variety 
of methods. A preferred method of producing polynucleotide sequences encoding mutant 
DNA polymerases having reduced discrimination against fluorescently labeled nucleotides is 
by using site-directed mutagenesis to introduce desired discrimination reducing mutations into 

15 polynucleotides encoding the parent DNA polymerase molecules. Site-directed mutagenesis 

techniques are well known in the art as exemplified by U.S. Patent No. 4,71 1,848; U.S. Patent 
No. 4,873,192; U.S. Patent No. 5,071,743; U.S. patent, 5,284,760; U.S. Patent No. 
5,354,670; U.S. Patent No. 5,556,747; Zoller and Smith, Nucleic Acids Res. 10:6487-6500 
(1982), and Edelman et al DNA 2:183 (1983). Detailed protocols for site-directed 

20 mutagenesis are also given many general molecular biology textbooks such as Sambrook et 

al Molecular Cloning a Laboratory Manual 2nd Ed. Cold Spring Harbor Press, Cold Spring 
Harbor (1989), Ausubel et al. Current Protocols in Molecular Biology, (current edition). 
Additionally, many text books on PCR (the polymerase chain reaction), such as Diefenbach 
and Dveksler, PCR Primer: A Laboratory Manual . Cold Spring Harbor Press, Cold Spring 

25 Harbor, NY (1995), describe methods of using PCR to introduce directed mutations. Genes 

encoding parent DNA polymerase may be isolated using conventional cloning techniques in 
conjunction with publicly-available sequence information. Alternatively, many cloned 
polynucleotide sequences encoding DNA polymerases have been deposited with publicly- 
accessible collection sites, e.g., the American type culture collection deposit accession number 

30 ATCC 40336 is a phage clone of Taq DNA polymerase. 

In addition to producing the mutant DNA polymerase encoding polynucleotides of the 
invention by introducing directed mutations into polynucleotides encoding parent DNA 
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polymerases, it is possible (although difficult) to produce the polynucleotides of the invention 
primarily by in vitro DNA synthesis techniques. In vitro DNA synthesis techniques are well 
known to those skilled in the art and examples of in vitro DNA synthesis can be found in U.S. 
Patent No. 5,252,530; U.S. Patent No. 4,973,679; U.S. Patent No. 5,153,319; U.S. Patent 
5 No. 4,668,777; U.S. Patent No. 4,500,707; U.S. Patent No. 5,132,418; U.S. Patent No. 

4,415,732; U.S. Patent No. 4,458,066; and U.S. Patent No. 4,811,218. When producing 
relative polynucleotide molecules by in vitro DNA synthesis, smaller molecules are usually 
produced first and subsequently joined together by hybridization and ligation. Mutant DNA 
polymerase encoding polynucleotides may also be produced by a combination of in vitro 

10 synthesis and site-directed mutagenesis of cloned genes. 

Polynucleotide encoding the mutant DNA polymerase of the invention may be used 
for the recombinant expression of the mutant DNA polymerases. Generally, the recombinant 
expression of the mutant DNA polymerase is effected by introducing a mutant DNA 
polymerase into an expression vector adapted for use in particular type of host cell. Thus, 

15 another aspect of the invention is to provide expression vectors comprising a polynucleotide 

encoding a mutant DNA polymerase of the invention, such that the polymerase encoding 
polynucleotide is functionally inserted int the expression vector. The invention also provide 
host cells comprising the expression vectors of the invention. Host cells for recombinant 
expression may be prokaryotic or eukaryotic. Example of host cells include bacterial cells, 

20 yeast cells, cultured insect cell lines, and cultured mammalian cells lines. Preferably, the 

recombinant host cell system is selected so as to closely match the organism from which the 
mutant DNA polymerase was derived. For example, prokaryotic DNA polymerases are 
preferably expressed in a prokaryotic expression system. A wide range of expression vectors 
are well known in the art. Description of various expression vectors and how to use them 

25 can be found among other places in U.S. Patent No. 56041 18; U.S. 5,583,023, U.S. Patent 

No. 5,432,082; U.S. Patent No. 5,266,490; U.S. Patent No. 5,063,158; U.S. Patent No. 
4,966,841; U.S. Patent No. 4,806,472; U.S. Patent No. 4,801,537; and Goedel et al., Gene 
Expression Technology, Methods of Enzvmology . Vol. 185, Academic Press, San Diego 
(1989). The expression of DNA polymerases in recombinant cell systems is a well-established 

30 technique. Examples of the recombinant expression of DNA polymerase can be found in U.S. 

Patent No. 5,602,756; U.S. Patent No. 5,545,552; U.S. Patent No. 5,541,31 1; U.S. Statutory 
Inventor Registration H 1,531; U.S. Patent No. 5,500,363; U.S. Patent No. 5,489,523; U.S. 
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Patent No. 5,455,170; U.S. Patent No. 5,352,778; U.S. Patent No. 5,322,785; and U.S. 
Patent No. 4,935,361. 

Other embodiments of the invention include multiple DNA polymerase compositions 
particularly useful for polynucleotide sequencing, such compositions comprise at least two 
5 different mutant DNA polymerases of the invention, wherein (1) the first mutant DNA 

polymerase is receptive with respect to a first fluorescently labeled nucleotide; (2) the second 
mutant DNA polymerase is receptive with respect to a second fluorescently labeled 
nucleotide; and (3) the first and second fluorescently labeled nucleotides differ from one 
another with respect to their nucleotide bases and fluorescent labels. The first and second 

10 fluorescently labeled bases may also differ with respect to one another by way of the linker, 

the base attachment position, or the fluorescent dye attachment site. The subject 
compositions are useful for catalyzing the sequencing reactions in Sanger type DNA 
sequencing with fluorescent dye labeled 2'3* dideoxy chain terminating nucleotides. Chain 
termination sequencing with fluorescently labeled terminators preferably employs at least two, 

1 5 and more preferably 4 different fluorescently labeled chain terminators, wherein each different 

base is labeled with a distinctive fluorescent label. Because of the necessary structural 
differences between the different fluorescently labeled chain terminators required for a 
sequencing reactions, i.e., nucleotide bases and fluorescent labels, there are many mutant 
DNA polymerases of the invention that are not receptive to all of the fluorescently labeled 

20 terminators necessary for a given sequencing reaction. Thus, there are embodiments of the 

subject DNA polymerases that may have undesirably high levels of discrimination against one 
or more of the labeled terminators used in a sequencing reaction set The subject 
compositions of two or more mutant polymerases ameliorates this problem by simultaneously 
employing multiple mutant DNA polymerases that are receptive to different chain labeled 

25 terminators, thereby having at least one of the mutant polymerases "compensate" for the 

discrimination against a particular fluorescently labeled terminator by the other polymerases 
catalyzing the sequencing reactions. The ratio of the different DNA polymerases in the 
composition preferably are selected so as to result in approximately equal levels of total 
activity for each of the different mutant DNA polymerases. Differences in specific activity 

30 between the different mutant polymerases may be taken into account when equalizing total 

activity ratios between the polymerases. Differences in activity levels between the various 
mutant DNA polymerases in the subject compositions may also be compensated for by 
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adjusting the levels of the different fluorescently labeled terminators in the subject 
compositions. The subject multiple polymerase compositions may comprise two, three, four, 
or more different mutant DNA polymerases. The mutant polymerase may or may not be 
derived from the same species or strain. The different mutation DNA polymerases in the 
5 subject mutant polymerase compositions may or may not be receptive for one or more of the 

fluorescently labeled nucleotides in a given set fluorescently labeled dideoxynucleotides for 
sequencing. 

The invention also includes various methods of using the mutant DNA polymerases 
(or subject multiple mutant DNA polymerase compositions) of the invention. The mutant 

10 DNA polymerases of the invention may be substituted for the corresponding parent DNA 

polymerases in most procedures that employ DNA polymerases. In order to more fully take 
advantage of the properties of the subject mutant DNA polymerases, the amount (or 
concentration) of labeled and unlabeled nucleotides used in the methods of the invention may 
be changed with respect to the amounts (or concentrations) used in the corresponding 

15 methods employing convention DNA polymerases. These changes in the amount of 

nucleotide may be optimized by routine experimentation. Methods of the invention comprise 
the step of extending a primed polynucleotide template with at least one fluorescently labeled 
nucleotide, wherein the extension is catalyzed by a mutant DNA polymerase of the invention. 
Thus, the subject methods result in the formation of one or more different fluorescently 

20 labeled polynucleotides produced by primer extension. The subject methods of synthesizing 

a fluorescently labeled polynucleotide may be used in a variety of procedures including, but 
not limited to, Sanger sequencing (e.g., dideoxy nucleotide chain termination), the polymerase 
chain reaction (PCR), polynucleotide labeling, minisequencing. The reduced discrimination 
against fluorescently labeled nucleotide properties of the subject mutant DNA polymerase is 

25 particularly useful for Sanger DNA sequencing reactions, including cycle sequencing. The 

use of the subject mutant DNA polymerases for Sanger sequencing reduces the amount of 
fluorescently labeled chain terminating nucleotides required for a sequencing reaction an may 
in many case be used to increase the number of bases that may be identified in single 
sequencing reaction that is analyzed on an automated fluorescence-based sequencing 

30 apparatus such as an Applied Biosystems 3 10 or 3 77( Applied Biosystems Division of Perkin- 

Elmer, Foster City, CA). Detailed protocols for Sanger sequencing are known to those 
skilled in the art and may be found, for example in Sambrook et al, Molecular Cloning, A 
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Laboratory Manual, Second Edition, Cold Spring Harbor Press, Cold Spring Harbor, NY 
(1989). 

The invention also provides kits for synthesizing fluorescently labeled polynucleotides. 
The kits may be adapted for performing specific polynucleotide synthesis procedures such as 
DNA sequencing or PCR. Kits of the invention comprise a mutant DNA polymerase of the 
invention and a fluorescently labeled nucleotide that exhibits reduced discrimination with 
respect to the mutant DNA polymerase in the kit. Kits preferably contain detailed instructions 
on how to perform the procedures for which the kits are adapted. Optionally, the subject kit 
may further comprise at least one other reagent required for performing the method the kit 
is adapted to perform. Examples of such additional reagents include unlabeled nucleotides, 
buffers, cloning vectors, restriction endonucleases, sequencing primers, and amplification 
primers. The reagents include in the kits of the invention may be supplied in premeasured 
units so as to provide for greater precision and accuracy. 

Other embodiments of the invention include kits comprising (1) the subject 
compositions of multiple mutant DNA polymerases, and (2) fluorescently labeled chain 
terminating nucleotides suitable for use with the subject compositions, i.e., each labeled chain 
terminator is receptive with respect to at least one of the mutant DNA polymerases in the 
composition. Additional embodiments of the invention include kits for sequencing DNA that 
comprise a multiple mutant polymerase composition of the invention and at least two different 
fluorescently labeled chain terminating nucleotides are labeled at different bases, wherein each 
of the fluorescently labeled chain terminating nucleotides is receptive with respect to at least 
one mutant DNA polymerase in the composition. 

The invention, having been described above, may be better understood by reference 
to the following examples. The examples are offered, for among other reasons, to illustrate 
specific embodiment of the invention and should not be construed as a limitation on the 
invention. 

EXAMPLES 

Example 1 

Purification of Mutant Forms of Tag DNA Polymerase 

Lysates ofE. coli containing recombinant constructs designed for the productiion of 
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recombinant mutant Taq DNA polymerases were made essentially as described in tDesai, UJ. 
and Pfaffle, P.K., Biotechniques, 19:780-784 (1995). In order to prevent the polymerase 
from binding to chromosomal and plasmid DNAs contaminating the lysate, 5 M NaCl was 
added dropwise to the heat treated, clarified lysates to bring the final NaCl concentration to 
5 0.25 M. DNA was then precipitated from this mixture by dropwise addition of 5% 

polyethylimine (in 20 mM TRIS»C1, pH 8.5) to make the final concentration of PEI 0.3%. 
Precipitation was allowed to continue for 5 minutes on ice. A white, cloudy precipitate was 
removed by centrifiigation at 15,000 x g for 15 minutes at 4°C. The supernatant fluid was 
decanted and saved. Following centrifugation, the NaCl concentration was reduced to 0.13 

10 M by monitoring conductivity of the solution during the addition of TETT minus NaCl (20 

mM TRIS-C1, 0.1 mM EDTA, 0.05% Tween-20, 0.05% Triton-XlOO, 1% glycerol, pH 8.5). 

Excess PEI was removed using a Bio-Rex 70 (BIO-RAD, Richmond, CA) column 
(2.5 x 30 cm). The column was poured and equilibrated with TETT Buffer + 0.1 M NaCl. 
The polymerase does not bind to the Bio-Rex 70 under these conditions. 

15 To remove contaminating E. coli proteins, the Bio-Rex 70 column eluate was loaded 

directly onto a Heparin-Agarose (Sigma Chemical Company, St. Louis, MO) column (1.5 x 
30 cm) which was also poured and equilibrated in TETT Buffer + 0.1M NaCl. The heparin- 
agarose column was washed with 2 column volumes of TETT + 0.1 M NaCl and Taq DNA 
polymerase was eluted as a sharp peak using TETT + 1 M NaC! Elution was monitored at 

20 280 nm. 

The heparin-agarose column fractions corresponding to the peak absorbance were 
pooled and concentrated to 0.15 ml using Ultrafree-15 Centrifugal Filter Devices (Millipore 
Corporation, MA) according to the manufacture's recommendations for centrifugation speeds 
and times. The concentrate was diluted to 1 5 ml with TETT Buffer + 5% glycerol and the 
25 sample was re-concentrated to 0. 1 5 ml. This was repeated one more time to reduce the final 

NaCl concentration below 1 mM in the protein samples. 

The concentrated polymerase samples were diluted two-fold using TETT + 5% 
glycerol and an equal volume of TETT + 95% glycerol was added to bring the final glycerol 
concentration to about 50%. Samples were stored at -20 °C. Protein concentrations were 
30 determined using the "Bradford Protein Assay" (BIO-RAD, Richmond, CA). Activity was 

measured using a radiometric assay (described elsewhere) 

Typical yields of polymerase from 2-liters of induced E. coli culture (corresponding 
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to 30- 50 ml of heat treated, clarified lysate) ranged from 4 to 24 mg. SDS-PAGE analysis 
of the purified samples showed one dark band of about 94,000 molecular weight) and several 
minor ones after Coomassie Blue staining. The gels indicated a typical purification level of 

> 90%. 

5 

Example 2 
Selectivity Assay 

An unlabeled versus dye-labeled terminator assay ("terminator" is defined as a non- 
extendible base such as 2\3- ddNTPs) was used to screen mutant Taq DNA polymerase 
1 0 samples for better Tet(II)»ddCTP incorporating mutant forms of this polymerase. This assay 

is based upon two substrates competing for the same active site at the same time during a 
steady state reaction in which only the polymerase concentration is limiting. Therefore, the 
assay measure the polymerase's "selectivity" for the unlabeled versus the fluorescein-labeled 
terminator. The DNA Primer/template used in this assay format is given below: 

15 

5'->(FAM)-CCC TCG CAG CCG TCC AAC CAA CTC A 

GGG AGC GTC GGC AGG TTG GTT GAG TGC CTC TTG TTT<-5' 
The next template position following the 3'-end of the primer is indicated above by the 
bold and underlined G 

20 

The reaction consisted of: 

80 mM TRIS-C1 (pH 9.0 at 20 °C) 
1000 nM DNA primer/template [ 5*-(FAM)25mer / 36 G, template ] 
2 mM MgCU 
25 5 0 nM TET(II)-ddCTP 

1 jiM ddCTP 
0.25 Units of enzyme 
40 nL reaction volume 
60 °C reaction temperature 

30 

Samples (2 |iL) were removed from the reaction mixture at pre-determined times 
(typically, 20 second intervals for 0.25 Units of polymerase activity per |iL) and added to ice 
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cold 50 fiL 0.5 M EDTA (pH 8.0). Timed aliquots were mixed and held on ice for further 
processing. 

Samples of each time point were processed to remove excess, unincorporated 
TET(II>ddCTP. Typically, 1 .6 |iL of each quenched sample were added to 250 of 0.8 
M LiCl plus 0.2 ^ig/ml R coli tRNA. followed by 750 |iL of 95% ethanol. After mixing, the 
nucleic acids were allowed to precipitate for 20 minutes at -20 °C. The precipitates were 
recovered by centrifugation using standard procedures. The supernatant fluid was discarded 
and pellets were dissolved in 50 |iL of 50% formamide. Gel samples were heat treated (95 °C 
for 2 minutes) and 2 \xL were loaded per sample lane on a 1 6% denaturing DNA sequencing 
gel. Gels were run on an Applied Biosystems Model 373 Sequencer using GeneScan 
Fragment Analysis software to measure the amount of FAM fluorescence in the bands 
corresponding to the 25-mer primer, the 26-mer product (indicating a ddC incorporation 
event) and the apparent " 27-mer" product band (indicating a TET(II>ddC incorporation 
event). 

The fluorescence signal in each of the bands was summed and the percent of signal 
in each band was used for further calculations as a normalization to avoid lane to lane loading 
differences. Energy transfer from the %-FAM moiety present on the apparent "27-mer" 
product molecules to the Tet(II) moiety on the newly incorporated 3'-base was not corrected 
since all ratios were compared to "wild type" or Taq G46D.) The normalized fluorescent 
signals in the 26-mer and "apparent" 27-mer product bands were corrected for the different 
concentrations of the two molecules used in the reaction and the corrected values were 
plotted versus time. The velocity of incorporation for each substrate was determined using 
least square fits to the data. The ratio of ddC / TET(II)»ddC incorporation rates is equal to 
the selectivity bias that the sample polymerase shows for the unlabeled versus the TET(II)- 
labeled nucleotides and reflects the following relationship: 



V ddC :v TcfddC 



= (K., / *m [ ddC ]-*■ (H at / ^..ddc [ Tet(II).ddC 



where: 



V Tct(n)-ddC 



ddC 



velocity of ddC incorporation 
velocity of Tet(II)«ddC incorporation 



'cat 



catalytic rate constant 
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K M = nucleotide equilibrium binding constant 

[ ddC ] = concentration of ddCTP in the reaction 

[ Tet(II>ddC ] = concentration of Tet(II)¥ddCTP in the reaction 

5 In this assay format, "wild-type" Taq or (Taq G46D) showed a selectivity bias or ddC 

/ Tet(II>ddC number of about 85 to 1 . Mutants showing lower selectivity bias ratios were 
submitted to further testing. The Table 2 below shows the results for a few of the mutants 
tested by way of a few examples: 

10 Table 2 



Taq 


Selectivity Number 


WT / Mutant 


G46D 


85 


85/85 or 1 


G46D; R660D 


8 


85/8 or » 10 


G46D; R595E 


28 


85/28 or * 3 


G46D; F667Y 


28 


85/28 or « 3 


G46D; E681G 


40 


85/40 or » 2 


G46D; D655L 


40 


85/40 or « 2 



20 Example 3 

Next Nucleotide Rate Effect Assay 

An additional kinetic step between "ground state" nucleotide binding or initial collision 
and correct base pair formation and the group transfer reaction would be expected to slow 
the polymerase dissociation rate from an Enz»DNA complex having a 3-dideoxynucleotide 

25 in an assay termed the "Next Nucleotide Rate Effect" (Patel et al., 1991), This assay 

measures the steady state rate of incorporation of ddTTP (i.e., the enzyme is limiting) in the 
absence or presence of the next correct nucleotide. The primer template pair is shown below: 

5'->(FAM)-CCC TCG CAG CCG TCC AAC CAA CTC A 
30 GGG AGC GTC GGC AGG TTG GTT GAG TAG GTC TTG TTT<-5' 

The next template position is indicated by the bold, underlined A. The next template 
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position beyond A is G. Under steady state reaction conditions, essentially all of the available 
polymerase is bound to the primer/template. When ddTTP is present alone in solution, it is 
incorporated following binding to its template position, A. Additional incorporation events 
require the polymerase to dissociate from the Enz-DNA complex and find another available 
5 primer/template that has not already undergone and incorporation event. Hence the rate of 

incorporation under these conditions is the dissociation rate of the polymerase from the 
Enz^DNA complex. If the next correct nucleotide, dGTP or ddCTP, is also present in the 
reaction mixture, the dissociation rate of the polymerase from the Enz»DNA*ddCTP complex, 
for example, will be slower if there is an additional kinetic step between the group transfer 
10 reaction that incorporated the ddTTP and an attempt by the polymerase to incorporate ddCTP 

in a processive mode of synthesis. This slower rate of dissociation an be detected as a slower 
incorporation rate of ddTTP since no chemistry can occur once ddTTP and the polymerase 
can no be processive despite the presence of another correct nucleotide. As shown in Figure 
2, the presence of the next correct nucleotide does indeed slow the turnover or dissociation 
15 rate of the polymerase (Taq G46D; F667Y). Figure 2 also shows that the presence of a 

fluorescein dye on the next correct nucleotide (in this case, Tet(II>ddCTP), appears to 
accelerate the turnover rate. We interpret this to mean that the polymerase is constantly 
undergoing a conformational change and that it can attempt to undergo the change even in 
the absence of the next correct nucleotide. However, the presence of a fluorescein dye on the 
20 next correct nucleotide blocks the ability of the polymerase to undergo such a change and 

thereby causes an immediate dissociation of the enzyme following the group transfer step for 
ddTTP incorporation. Hence, the fluorescein dye appears to accelerate the polymerase 
dissociation rate by eliminating a kinetic step (or steps) following the group transfer reaction. 
Figure 3 shows the results for a Next Nucleotide Rate Effect assay for a "multiple" 
25 mutant form of Taq DNA polymerase, Taq G46D; R660D; F667Y; E681G. In this case, the 

presence of Tet(II) on the next correct nucleotide is "transparent" to the mutant polymerase. 
We interpret this to mean that the mutant polymerase can indeed undergo the same kinetic 
steps following group transfer that "wild-type" versions of this polymerase undergo. We also 
interpret these results to indicate that the F667Y mutation belongs in a different class than the 
30 R660D or E681G mutations since Taq G46D; F667Y still shows a "fluorescein-effect" in the 

"Next Nucleotide Rate Effect" assay, however, the multiple mutant, Taq G46D; R660D, 
F667Y; E681G, does not. 
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Typical assay conditions for the Next Nucleotide Effect assay were as follows: 



1 000 nM primer/template DN A 

80 mM TRIS-C1 (pH 9.0 @ 20° C) 
5 2.4mMMgCl 2 

0.02 Units/|iL polymerase activity 
400 (iM each nucleotide (when present) 

Samples were taken and processed in the same manner as described under "Selectivity 
10 Assay." In this case, it is possible to distinguish a ddC-incorporation event from a 

Tet(II>ddC incorporation event by the migration rate of the resulting fragments in a 16% gel. 
Incorporation of ddC results in a "normal" 26-mer band that migrates as expected above or 
slower than the 25-mer primer. Incorporation of Tet(II>ddC results in slower migration 
causing the band to migrate with an apparent size equivalent to a 27- or 28-mer. 

15 

Example 4 

Analysis of Additional Mutants 

Table 1, provided below, provides a summary of results obtained with selectivity 
assays performed with several different Taq mutants. The analogous site for the mutation in 
20 the enzymes R coli DNA polymerase I and phage T7 DNA polymerase are also noted. The 

term "FS" refers to a Taq DNA polymerase having a F667Y mutation. 
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Incorporation by Reference 

This application incorporates all publications, patents, and patent application 
referenced herein in there entirety. 

Equivalents 

While the invention has been described and illustrated with reference to specific 
embodiments, those skilled in the art will recognize that modifications and variations may be 
made without departing from the principles of the invention as described hereinabove and set 
forth in the following claims. 
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R660- Mutants 

Lvsate # Genotvpe 



Acidic- 

Aspartic acid 



29 
38 
39 
40 
41 
49 



CS; R660D 
FS;R660D 
R595E; FS; R660D 
D655L; FS; R660D 
FS;R660D;E681G 
CS; R660D 



Specific Activity 



14 
9 
0 
12 
31 
41 



Tet Selectivity 
TET(II)«ddCTP / ddCTP * 
(Mutant / WT) 



10* 

10 

nd 

10 

12 

nd 



Glutamic Acid 5 1 
72 



Basic- 
Lysine 

Histidine 

Imino- 
Proline 

Aliahatic- 
Alanine 

Isoleucine 

Valine 



Leucine 
Glycine 



50 
101 

66 

68 

73 

90 
55 

91 
52 

47 
78 



Polar Uncharged- 
Glutamine 53 
69 

98 

93 

97 

96 



Serine 

Cysteine 

Asparagine 

Threonine 

Methionine 

Aromatic 
Phenyalanine 

Tyrosine 



92 
95 



FS;R660E; E681G 
FS; R660E 



FS; R660K 
FS; R660H 

FS; R660P 

FS; R660A 

FS; R660I 

FS; R660V 
FS;R660V;E681G 

FS; G660L 
FS;R660L; E681G 

FS; R660G; E681G 
FS; R660G 

CS; R660Q 
FS; R660Q 

FS; R660S 

FS; R660C 

FS; R660N 

FS; R660T 



FS; R660F 
FS; R660Y 



11 

1 



28 
13 



4 
5 

10 

1 

8 
28 

18 
8 

47 
5 

16 

14 

13 

26 



9 
17 



7 
.7 



1** 

1 

1 

4 

1 
1 

0.6*** 

1 

6 
2 

1 
3 

7 

4 

3 

3 



0 2*** 

1 



Table 2 
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E681- Mutant 

Lvsate # Genotype Specific Activity Tet Selectivity 

TET(ID-ddCTP / ddCTP * 



Acidic- 

Aspartic acid 


71 


FS;E681D 


9 


■ (Muiant / Wl 
4* 


Basic- 
Lysine 


/j 


r5; -COO IK 


52 


6 


Arginine 










His d dine 


SO 


FS; E681H 


37 


7 


lmmo- 

Proline 


74 


FS;E68lP 


19 


9 


AHohatic- 
Alanine 


63 


FS;E68lA 


13 


•6 


Isoleucine 


99 


FS;E68lI 


37 


27 


Valine 


76 


FS;E68lV 


110 


10 


Leucine 


87 


FS;E681L 


22 


14 


Glycine 


48 


FS;E681G 


37 


6 


Polar Tlncharsxed- 
Glutarnine 








Serine 


61 


FS;E681S 


12 


5 


Cysteine 


88 


FS;E681C 


20 


2 


Asparagine 


89 


FS;E681N 


40 


A 


Threonine 


81 


FS;E68lT 


35 


6 


Methionine 


85 


FS;E681M 


32 


47 


Aromatic- 
Phenyalanine 










Tyrosine 


80 


FS;E681Y 


42 


3 


Tryptophan 


84 


FS;E68lW 


37 


17 


11-05-97 











*Ratio > 1 means improved TET(II)*ddCTP incorporation, 
enzyme. 

**Ratio = 1 means wild-type activity. 
***Ratio < i means activity worse than wild-type. 

M>I>W>L>V>P> H=K=G=T=S > D=A=N > Y=C 
47 27 17 14 10 9 7 6 6 6 5 4 4 4 3 2 

Table 2 (Continued) 
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Tryptophan 94 FS;R660W 



D > E=S > C=A=Q=T =N> G > K=P=V=Y=W=H > I = L » F 
10 77 443332 1111 1 0.9 0.6 0.1 

*Ratio > 1 means improved TET(II)*ddCTP incorporation. Must be "85" to be "transparent to the 
enzyme. 

**Ratio = 1 means wild-type activity. 
***Ratio < 1 means activity worse than wild-type. 



Table 2 (Continued) 
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CLAIMS 

What is claimed is: 

5 1. A DNA polymerase having at least one mutation in the nucleotide label 

interaction region, wherein the DNA polymerase has reduced discrimination for fluorescein- 
type dye labeled nucleotides. 

2. A DNA polymerase according to claim 1 , wherein the mutation is in portion 
10 of the nucleotide-label interaction region selected from the group consisting of (i)the O- 

helix, (ii) the K helix, and (iii) the inter O-P helical loop. 

3. A DNA polymerase according to claim 2, wherein the mutation is segment of 
the enzyme corresponding to amino acid residue selected from the group consisting of E520. 

15 A531, L522, R523, E524, A525, H526, P527, I 528, V529, E530, K531, 1532, R536, E537, 

R573, Q582, N583, V586, R587, P589, Q592, R593, R595, D610, T612, Q613, E615, 
R636, D637, T640, F647, V654, D655, P656, L657, R659, R660, T664, E681, L682, A683, 
1684, P685, E688, F692, Q754, H784, L817, E820, L828, K831,and E832. 

20 4. A DNA polymerase according to claim 3, wherein the mutation is at a position 

selected from the group consisting of, R595, D655, R660, T664 and E681. 

5 . A DNA polymerase according to claim 4, wherein the DNA polymerase is Taq 
DNA polymerase. 

25 

6. A DNA polymerase according to claim 5, wherein the mutation is selected 
from the group consisting of R660D, D655L, E618G, and R595E. 

7 A DNA polymerase according to claim 6, wherein comprising a mutation set 
30 belonging to the group consisting of (G46D, R660D, F667Y), (G46D, R595D, R660D, 

F667Y), and (G46D, R660D, F667Y, E681G), and (G46D, F667Y, E681G). 
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8. A DNA polymerase according to claim 2, wherein the DNA polymerase is a 
thermostable DNA polymerase. 



9. A polynucleotide encoding a DNA polymerase according to claim 1 . 

10. An expression vector having a promoter, wherein the vector comprises a 
polynucleotide according to claim 1 in functional combination with the promoter. 

11. A host cell comprising an expression vector according to claim 10. 

12. A method of synthesizing a fluorescent ly labeled polynucleotide, said method 
comprising the step of mixing a DNA polymerase according to claim 1 with a primed 
template. 

13. A method according to claim 12, wherein the primed template is a primed 
template in a chain termination sequencing reaction. 

14. A method according to claim 12, wherein the primed template is a primed 
template in a polymerase chain reaction. 

15. A kit for fluorescently labeling a labeling a polynucleotide, the kit comprising 
a DNA polymerase according to claim 1 and a fluorescently labeled nucleotide. 
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Figure 2 
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